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We study statistics of the knockout tournament, where only the winner of a fixture progresses to the next. 
We assign a real number called competitiveness to each contestant and find that the resulting distribution of 
prize money follows a power law with an exponent close to unity if the competitiveness is a stable quantity 
and a decisive factor to win a match. Otherwise, the distribution is found narrow. The existing observation of 
power law distributions in various kinds of real sports tournaments therefore suggests that the rules of those 
games are constructed in such a way that it is possible to understand the games in terms of the contestants' 
inherent characteristics of competitiveness. 



Competition is a ubiquitous form of social interaction for distributing limited resources among a number of 
individuals, often regarded as the opposite of cooperation. Competition has been a main tenet in eco- 
nomics where a perfectly competitive equilibrium is proven Pareto- efficient as long as there are no 
externalities and public goods. Moreover, the notion of natural selection in biological evolution is often under- 
stood as proving competition 'natural'. For these reasons, although competition results in growing tension across 
a society, most people have taken it for granted as an organising principle of our society. 

Recently, Deng et al. 1 claimed universal power-law distributions of scores and prize money by observing 
various kinds of sports such as tennis, golf, football, badminton, and so on. According to their extensive data 
analysis, the probability to find scores or prize money greater than k always decays as a power law P>(/c) ~ k~ {y ~ l) 
with an exponential cutoff where the power-law exponent y — 1 ranges between 0.01 and 0.39 depending on 
sports. In addition, they presented a knockout-tournament model to explain the observations. This is an intri- 
guing approach since the most organised forms of competition are usually found in sports. It is also popular to run 
a knockout tournament, consisting of successive rounds where only a winner in each fixture progresses to the next 
round, because it is an efficient procedure to find who is the best with a small number of fixtures. In other words, 
Deng et al. hinted a direct connection between the structure of competition and its consequences. Physicists have 
already recognised sports as a fruitful research field: Statistics of athletic records has been pioneered by Gembris et 
al. 2 and Wergen et al. 3 , for example, and there have been attempts to even predict the limiting performances in the 
long run 4 . Sports ranking combinatorics has also been considered by Park and Newman 5 ' 6 . If we are to understand 
the dynamics governing high achievements in sports careers, in particular, one famous theory along this direction 
is called the Matthew "rich get richer" effect 7 " 9 : It says that a higher position leads to a better chance to progress 
further in career, resulting in an extremely skewed distribution. The spatial Poisson process to model this effect 
indeed explains such behaviour with y < 1, which is found in some empirical data sets. However, we should point 
out that many factors of competition are hidden in the probability of progress, and that the stochastic process is 
totally indifferent to individual characteristics as written in Ecclesiastes: "the race is not to the swift, but time and 
chance happenth to them all". 

In this work, we instead focus on statistical analysis of a specific system of competition, i.e., the knockout 
tournament among inhomogeneous participants. Our main point is that a large part of statistics is universal in the 
sense that it is independent of most details of the game but already determined by the tournament structure. Let us 
consider a player's number of wins denoted by n, for example. When the tournament has been finished, the 
distribution of n denoted by P(n) is always an exponentially decreasing function of n. It is a purely geometric 
property of the tournament tree independent of any details of the game, loosely mapped to the critical percolation 
on a binary tree 10 . If the prize money is highly skewed towards the best players, similarly to real sports tourna- 
ments, one can assume that the prize money k n after winning n rounds is also an exponential function of n, that is, 
k n ~ z n (Fig. 1). Combining these two, one finds that the distribution P(k) ~ k~ Y with 
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Figure 1 | Schematic illustration of a tournament with four contestants 

A, B, Q and D. Contestant B has competitiveness r B and gets prize money 
k B = z 2 because she has defeated A and C. Likewise, C gets /c c = z 1 because 
she has won only a single match against D. 



y=( lo g 2 



+ 1, 



(i) 



and this mechanism belongs to combination of exponentials accord- 
ing to Newman 11 . If z gets very large, y converges to unity, yielding 
P(k) ~ k~ l . As z —> 1, on the other hand, y diverges because P(k) 
approaches the distribution function of n, which is an exponential 
function. In fact, if z < 2, the total amount of prize money gets 
unbounded as the number of contestants grows, which means that 
the organiser of this tournament has a risk of bankruptcy. This 
explains why k n has to be such a rapidly increasing function of n, 
and we see that the feasible range of y is between one and two. 
Moreover, if there is a typical number of prize winners, z is effectively 
very large, driving y to unity. This is a simple prediction for a single 
tournament. In other words, this analysis corresponds to gathering 
data of prize money distributed over many tournaments without 
identifying who was who. The actual statistics collected in this way, 
however, will not be very interesting to us, and it is usually more 
meaningful to consider individual-based statistics: Even for a team 
sport, each team may be regarded as an individual. It is notable that 
Deng et al. resolve this problem by introducing the notion of ranks, 
belonging to individuals, and also by assuming that a player's win- 
ning probability against another is a function of their rank difference. 
Following this approach, we will see how our simple prediction in 
equation (1) can be reproduced on average in the individual-based 
statistics. 

Results 

Decisiveness of competitiveness. Imagine a tournament with N = 
2 m contestants to construct a simple binary tree. Each person is 
assigned a real number r, which we refer as competitiveness 
instead of a rank, and reserve the latter term for denoting an 
outcome of competition, which may or may not reflect an 
individual's genuine competitiveness depending on how much luck 
comes into play. By defining r as a real number, the competitiveness 
is automatically assumed to be transitive, which means that if 
contestant A is more competitive than B who is more competitive 
than C, then A is also more competitive than C. Since we can always 
rescale the highest competitiveness as unity and the lowest one as null 
without loss of generality, the real number r belongs to a closed 
interval from zero to one. 

Under total uncertainty about the contestants, we may assume as 
our initial condition that the distribution of r is uniformly random at 
the starting point. We thus denote the initial probability density 



distribution of r as p 0 (r) = 1 with normalisation p 0 (r)dr=l. 

Jo 

Then, we introduce a function /(r, r') that defines the probability 
for a contestant with competitiveness r to defeat another with r' . As 
was done by Deng et al. 1 , it can be assumed to be a function of x = r — 
r' only, and it is plausible in such a case that/(x) is a nondecreasing 



function of x G [ — 1,1] with/(x) + f(—x) = 1. In words, the former 
condition means that a more competitive player has a higher prob- 
ability to defeat a less competitive player, whereas the latter condition 
is merely a simple reflection of the trivial fact that one of the two 
players must win, irrespective of their values of r. Let us check some 
examples off(x). 



Perfect resolution. One of the simplest choices is 

f(r, r') = @(r-r'), 



(2) 



where 0 is the Heaviside step function. This means that the compe- 
titiveness decides the outcome deterministically. In Methods, we 
have derived the following nonlinear recursive relation 



p n+l (r)=2p n (r) 



dr'f(r, r')p n {r'), 



(3) 



where p n {r) means the distribution of r after the nth round. With the 
Heaviside step function, this equation is solvable at any arbitrary n 
and we obtain 



Pn(r) 



with a corresponding cumulative distribution c n (r) 



(4) 



p n {r')dr' 

Jo 



= r r . As explained in Methods, c n (r) is identical to the winning 
chance for the contestant with r at the (n + l)th round, denoted 
by w n (r), when we have chosen the step function in equation (2). 

We can extract various useful information from this probability 
density function. For example, the average competitiveness after the 
nth round is 



{r)n =f o drrp n (r)= Y ^- n , 



(5) 



and therefore the width oip n {r) decreases as a ~ 2 ".A contestant 
with r passes the nih round but not the next one with probability 



n-l 

II w k (r) 



[l-w n (k)]=r 2 - 1 1-r 



(6) 



where we have used w k = c k and the sum over n is normalised to unity 
for any r between zero and one. The average prize money for this 
person with r can thus be calculated as 



k(r)= y^k n q n (r). 

n = 0 

As shown in Fig. 2, q n has a peak at n* =log 2 
summations above can be approximated as 



log 2 r 



(7) 



and the 




Figure 2 | Conditional probability to progress only to the nth round for 
given competitiveness r [see equation (6)]. 
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Figure 3 | (a) Probability distribution of r at the 5th round when/(x) is the Heaviside step function, equation (2). The data points are obtained 
numerically by simulating 10 4 tournaments with N = 2 12 and the line shows our analytic prediction in equation (4). (b) Average value of r at the nth 
round, where the data points are obtained numerically and the line represents equation (5). 

f(x) = 1/2 into the recursive equation (3). The resulting P(k) is just 
the most likely distribution of the prize money among the AT players, 
so the maximum entropy principle tells us to maximize 



k{r)^k m q n 



4r 



(8) 



If k n = z n y it means that k(r)^-z m oc 



lo g 2 



»(l-r) 



-log 2 z 



1 

4~ ~\ lnry 

in the vicinity of r = 1. Note that we have approximated r as unity at 
the denominator of equation (8). Therefore, Zipf's plot shows a 
power law with slope — log 2 z, leading to P(k) ~ k~ y with y = (log 2 
z)~ l + 1 due to the relationship between Zipf's plot and P(k) 12 . This 
exactly coincides with equation (1) derived for a single tournament. 
We have numerically performed tournaments and the results con- 
firm validity of our analysis as shown in Fig. 3, where the numerical 
calculations of c 5 (r) and {r) n agree perfectly with the analytic results. 
The detailed procedure of our simulation is explained in Methods. 

Imperfect resolution. As an opposite extreme case, let us consider a 
situation where individual competitiveness is totally irrelevant to the 
outcome of a match and only luck decides. In other words, we assume 
a constant function f(x) = 1/2. If we start fromp 0 — 1> the winning 

chance here is w 0 (r) = dr' f(r, r')p 0 (V) = 1 /2. Note that w 0 is not 
Jo 

identical to the cumulative distribution any more. The next round 
has a distribution p\(r) = 2w 0 (r)p 0 (r) = 1, and this pattern is 
repeated all the way leading to p n (f) = 1 for every n. It is also 
straightforward to obtain the same result by substituting the constant 



~A 10 




Rescaled prize money k 

Figure 4 | Cumulative distribution of prize money, where the horizontal 
axis is rescaled with respect to the largest value. The data points are 
obtained numerically by simulating 10 4 tournaments with N = 2 12 and z = 
2, in ascending order of T from below. The straight line shows our analytic 
prediction for T = 0 for comparison. 



H=-^P(fc)lnP(fc)-^fcP(fc), 



(9) 



where the first term is Shannon entropy and /n represents a 
Lagrangian multiplier for constraining the average prize money. 
When H is maximised, it does not change under variation in P(k) 
to the first order, and we thus have 



0 = 3H = — SP(k) I 1 + ln p ( k ) + V k ) > 



(10) 



which leads us to P(k) ~ exp( — k/k c ) with a characteristic scale k c . 

This implies a tendency that P(k) usually exhibits a power law with 
an exponent close to unity but that randomness makes the tail 
shorter. Suppose that f(x) has a finite resolving power, quantified 
by a characteristic width T over which f(x) rapidly increases. The 
Heaviside step function corresponds to a limiting case of T — > 0. We 
can predict the followings when T is finite but sufficiently small: At 
the beginning of the competition, the width a oip n {r) is much greater 
than r, sof(x) effectively serves as a step function. The above analysis 
shows that a decreases as 2"" so it becomes comparable with Y after v 
~ log 2 (l/r) rounds. Thereafter, the decrease of a slows down. 
Finally, when g^T after many rounds, the survivors' competitive- 
ness is irrelevant and the outcomes are mostly determined by pure 
luck. Therefore, a natural guess for P(k) would be 



P(k)~k- y exp(-k/k T ), 



(11) 



with k r ~ 0(z v ) and y in equation (1). This functional form is con- 
firmed in our numerical simulations (Fig. 4). This distribution can 
also be derived from the maximum entropy principle as in equation 
(10) but with an additional constraint on ln /c 1314 , which corre- 
sponds to the total number of fixtures in this context. The above 
argument can be pursued further by employing the following f(x): 



fix) 



1 m 

l--e~ x/l forx>0, 
2 



-e x ' Y 



(12) 



otherwise, 



where the exponential functions make it possible to explicitly evalu- 
ate the integral. Then, the winning chance is given as 



<*>M = 



dr' f(r, r')p 0 (r') 



(13) 



dr'f(r, r')p,(r')+ f dr' /(r, r')p 0 (r') (14) 

0 Jr 
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Figure 5 | Effects of imperfect resolution, (a) pi(r), the distribution of competitiveness after the first round and (b) p 2 (r) after the second round. The 
resolution parameter is the width of f(x) y which is set to be T = 5% here. For comparison, the dotted lines show the cases for T = 0. 



. T -r/r_ T ( r _ 1)/r 



(15) 



which approaches c 0 (r) = r as T 0 and c 0 (r) = 1/2 as T °°, as 
expected. As above, this yields 

pi (r) = 2c 0 (r)p 0 (r)=2r + Te^ - Te^ ^ r , 



which is normalised to unity as 



[ drp x 
Jo 



16) 



(r) = 1. This result is quite 



suggestive, because equation (16) modifies equation (4) at n = 1 by 
adding 0(T) when r < T and subtracting the same amount when 
(1 — r) < r [Fig. 5(a)]. In short, p 0 (r) becomes flatter when r is close 
to 0 or 1 . If we take one step further, the low-r correction becomes less 
important and we find 

p 2 (r)^4r 3 -6Tre^ r -^ r , (17) 

where we have left only the dominant correction of 0(T) [Fig. 5(b)]. 
For general n, the result up to the correction of 0(T) is inductively 
found as 



Pn(r) 



\2 n 



-l)Tr r ~ l - l e^' T . (18) 



This implies that the finite resolution is most noticeable among 
highly competitive players with ( 1 — r) < T, whereas the story looks 
similar to the case of perfect resolution when (1 — r) is small but still 
much larger than T. 

Stability of competitiveness. We have assumed that competitive- 
ness is each individual's inherent characteristic, which changes in a 
much longer time scale compared to outcomes of competition, and 
we relate the latter to ranks. The idea is that although a contestant's 
rank fluctuates over tournaments, it will correctly reflect her true 
competitiveness in the long run. Even if the competitiveness may 
interact with actual tournament results, it will usually be related to 
a cumulative measure of performance that mainly reflects low- 
frequency, i.e., long-term behaviour. For example, we have 
calculated the Kendall tau rank correlation coefficient 15 , denoted 
by t, to see how the accumulated amounts of prize money change 
their relative positions between two successive tournaments (Fig. 6). 
If a certain pair of contestants keep their relative positions, they are 
said to be concordant, and discordant otherwise. The coefficient t is 
defined as the number of concordant pairs minus that of discordant 
pairs, divided by the total number of possible pairs. Beginning with 
the same initial amount of money for every contestant, which is set to 
zero, we run fifty tournaments in a row, accumulating the prize 
money for each individual. A contestant's accumulated money 
from a series of tournaments determines her performance in the 
next tournament in such a way that r = (N — i)/(N — 1) is as- 
signed to the contestant when she has the z'th largest accumulated 



amount. The relative positions of two equal amounts are random. In 
spite of this variability, the ranks of the accumulated money get 
stabilised after 20 or 30 tournaments in all the cases considered 
(Fig. 6), and the resulting P(k) is almost identical to the static- r 
case for each T. Still, one may ask what happens if their time scales 
approach each other so that a current rank directly affects perfor- 
mance at the next tournament, provided that the tournaments are 
regular events. Even if an individual's rank fluctuates over time, it 
might still be possible for this correlation between successive 
tournaments to reproduce the power-law tail part of P(k). In fact, 
this question is not really well-posed because a knockout tournament 
leaves many contestants' ranks undetermined except a few prize 
winners, and this is the fundamental advantage of a knockout 
tournament. We nevertheless suppose that a player's competitive- 
ness at the next time step is a nondecreasing function of the current 
performance, say, r t+ i = R(n t ), where n t is the number of wins in the 
tournament at time t, and R is a nondecreasing function between zero 
and one. Since r determines how many rounds the contestant can go 
through, the distribution of n t+ x is essentially a function of n t . The 
situation is actually boring because the same contestant wins the first 
place all the time, but we may exclude this exceptional contestant 
from our consideration. We begin with noting that any tournament 
results in a distribution of n t as po(n t ) =2~ Ht ~ 1 , which is the initial 
distribution of the next tournament at time t + 1. The corresponding 
cumulative portion of contestants with results below n t is thus 
Co(n t ) = 1 — 2~ Ht . As above, if/(x) is the Heaviside step function 
with f(0) = 1/2, the chance to win the first round for a contes- 
tant that passed n t rounds at the previous tournament is 
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Figure 6 | Behaviour of the Kendall tau rank correlation coefficient for 
the contestants' performance when the each contestant's cumulative 
prize money determines her competitiveness. 
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Figure 1 \ (a) The horizontal axis means the result of a tournament at time t, and the vertical axis means probability to find a contestant with n t at the kth 
round of the next tournament at t + 1. Note the similarity in shape at k ^ 4, which means that pk(n t ) ~ U{ y) with y=k — n t . (b) Conditional probability 
q k (t) also converges to a certain function V(y) (see text). 



Wo(Hf) = \po(n t ) + Co(n t ). The first term represents the probability 
to meet an opponent with the same n t , and the factor of one half 
originates from/(0). The distribution of n t at the next round ispi(n t ) 
= 2w 0 (n t )p 0 (n t ). We can repeat this procedure to obtain a general 
expression as 



Ck(n t ) = 



1- 



1 



g( n t). 



g(k) 



with g(x) = 2 X . By definition, we have 

pk{n t ) = c k (n t + l)-c k (n t ). 



(19) 



(20) 



If k is not very small, pk(n t ) converges to a certain function ofy = k — 
n t with a maximum around y ~ 0 [Fig. 7(a)]. The conditional 
probability to reach k and stop there for given n t is found as 



qk(n t )-- 



k-l 

n Wj(n t ) 



[l-w k (n t )] 9 



(21) 



with Qk(nt) = l [Fig- 7(b)]. We observe that q k (n t ) can also be 
described as a certain function V(y) when « t > 3. Moreover, we find 
that Ylk=o Qk(nt) > 1/2 for any n t . In other words, the time series {n t 
> 0} can be roughly described as a biased random walk towards the 
origin. Since this holds true for anyone, each contestant's average 
result will be rapidly equalised by the bias so we predict that the 
probability distribution P(k) will be narrow. This prediction is well 
substantiated by numerical results shown in Fig. 8, where P>(/c) is 
drawn in a semi-log plot. Therefore, in terms of the time scale of 
competitiveness, the power-law shape of P(k) is observable when 
competitiveness changes much more slowly compared to the 
frequency of tournaments. 

Discussion 

In summary, we have investigated statistics resulting from knockout 
tournaments. It is basically the rules of the game that define compe- 
titiveness, so the distribution of prize money is dependent on how 
much the rules take individual competitiveness as a decisive and 
stable factor. But other details of the game are found irrelevant, 
and the statistics is universal in this sense. More specifically, if com- 
petitiveness is a static parameter and any tiny difference of it can be 
distinguished by the rules, the distribution is predicted to take a 
power-law shape P(k) ~ k~ y with y close to unity. If the difference 
is indistinguishable below a certain resolution limit T, we find an 
exponential cutoff at the tail, whose location is a function of T. We 
have also argued that the distribution P(k) becomes narrow again 
when competitiveness changes with a time scale comparable to the 
frequency of tournaments. In this respect, the broad distributions 
observed across many sports suggest that their rules are already 
stabilised in such a way that one can readily compare contestants' 
competitiveness in a consistent way over a long time span and that 



the result of competition sensitively reflects the difference indeed. 
Since our analysis relates certain internal parameters of a given tour- 
nament such as z and T to the final distribution of prize money, 
which is somewhat more easily accessible, it will an interesting ques- 
tion to verify such detailed relationships directly on empirical 
grounds. 

Methods 

Recursive relation for p n (r). In case of perfect resolution, i.e.,/(r, r') = @(r — r'), it is 
straightforward to obtain the winning chance for the contestant with r at the first 
round of the tournament as 



w 0 (r)= dr'f(r, r')p 0 (r') = r, 



(22) 



where p 0 (r') = 1. This happens to be identical to the cumulative distribution c 0 (r) and 
it represents the simple fact that the contestant with r should meet an opponent with 
r' < r in order to win and progress to the next round. When the first round has been 
finished, the distribution of their competitiveness is 



pi(r) =2w 0 (r)p 0 (r) =2r, 



(23) 



which is again normalised to unity. The factor of two in front is needed because the 
number of survivors has become one half of N. Note that we have used independence 
between a player's competitiveness and her opponent's in equation (23), which is the 
case when the initial condition contains no correlations in competitiveness. As in the 
first round, the corresponding cumulative distribution, 



f 

Jo 



c,(r)= d/ f{r/)p x (r') = r\ 



(24) 



is identical to the winning chance Wi(r) at the second round. In the same way, the 
distribution after the second round isp 2 (^) = 2wi(r)pi(r) = 4wi(r)w 0 (r)p 0 (r) = 4r 3 , 
and so on. For general /(r, r'), we can use essentially the same argument to derive the 
following nonlinear recursive relation: 




0.4 0.6 0.8 1 

Rescaled prize money k 
Figure 8 | Cumulative distribution of prize money, when each 
contestant's tournament result at time t determines her competitiveness 

at t + 1. We have numerically simulating 10 4 tournaments with N = 2 12 
and z = 2. We have used the Heaviside step function as f[x), and this plot 
has excluded the one that always wins the first place. 
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p n+ i(r) = 2 Pn (r) f dr'f(r,r')p n (r'\ (25) 
Jo 

which is explicitly solvable for a few special cases as above. 

Numerical procedures. First, we generate a tournament tree with N = 2 m contestants 
at the terminal nodes and assign to each of them a real random number r inside the 
unit interval as competitiveness. One may require the minimum and maximum of the 
random numbers to be strictly zero and one, respectively, but it does not make a 
visible difference when N is large enough. The resulting uncorr elated random number 
sequence {r ls r 2 , r N } means absence of a seeding process, so number one and 
number two seeds may face each other in the first round. Second, when two 
contestants A and B meet with r A and r B , respectively, we draw a random number p e 
[0, 1) and choose A as the winner of this fixture if p <f(r A , r B ), and choose B 
otherwise. This is repeated for every match in this first round, and the winner 
progresses to the parent node. When we have filled all the parent nodes with 2 m_1 
winners, the second round starts among them in the same way as before. As the 
tournament proceeds round by round, the number of survivors decreases rapidly 
until the final winner is left alone after the mth round. Each player defeated at the nth 
round receives prize money z"" 1 , whereas the final winner acquires z m . When a 
tournament is over, we start a new one with randomly shuffling {r ls r 2 , . . ., r N } at the 
terminal nodes, so that the competitiveness is identified as an individual characteristic 
preserved across the tournaments. We have performed 10 4 shuffles, hence the same 
number of tournaments, to obtain statistical averages for each r,- with N = 2 12 . 
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